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Abstract 

We take prior-to-crash market prices (NASDAQ, Dow Jones Indus- 
trial Average) as a signal, a function of time, we project these discrete 
values onto a vertical axis, thus obtaining a Cantordust. We study said 
cantordust with the tools of multifractal analysis, obtaining spectra by 
definition and by lagrangian coordinates. These spectra have properties 
that typify the prior-to-crash market situation. Any of these spectra en- 
tail elaborate processing of the raw signal data. With the unprocessed 
raw data we obtain an instability index, also with properties that typify 
the prior-to-crisis market situation. Both spectra and the instability in- 
dex agree in characterizing such crashes, and in giving an early warning 
of them. 



1 Introducion 

We propose to apply some tools of multifractal analysis to the study of market 
crash situations (NASDAQ, Dow Jones Industrial Average). We take the data 
of market price fluctuations on a daily basis and we process it in two ways: 1) 
The prior-to-crash market prices are taken as a discrete one dimensional signal 
— this will be the Cantordust, covered with adjacent boxes k with equal length 
I. These boxes are intervals. The weight Wi of box k will be the number of days 
in which the market prices are in box Z^. The weights Wi are then normalized 
into probabilities p ri , by dividing each Wi by the total T cardinality of the 
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sample. Then, the concentration a of a box Zj is a(h) := t ^ . Therefore, 
a-min corresponds to the heaviest box(es), a max to the lightest one(s). The 
segment [a m i n , a max ] is divided into adjacent intervals of equal length Aa, all 
a's in a Aa are identified, and N a is their number. We define f*(a) := ■ 
This spectrum-by-definition is hereby called f*{a) in order to distinguish it 
from the usual lagrangian f(a) given by the thermodynamic algorithm: a — 

T'(q)J(a) = qa- r(q),q = f(a)J"(a) < 0, and r(q) = ^ r ' , I small, q 

a parameter, -co < q < oo. 

Notice that, in order to obtain f*(a) and/or /(a), we process weights Wi, T, 
and even N a , which is — for instance — always logarithmized and then divided 
by logl/l in order to produce f*(a). 2) In this paper, we — also — use the 
unprocessed data above: the unprocessed w% and the unlogarithmized N a , in 
order to define a market instability index as a quotient Qi that characterizes 
the prior-to-crash market situation. Then we use the processeed f*(a) and f(a) 
in order to strengthen the key properties of Qi that reveal the market collapse 
situation. 

Note : /*(«) and /(a) do not always coincide. 



2 A theoretical framework: the binomial case. 

We consider the simplest of Cantordusts: the ternary K of Cantor in the unit 
interval / = [0, 1]. We endow the subfractals — or "subcantors" — in [0, |] and 
[|, 1] with probability measures p r and 1 — p r respectively, < p r < 1, p r ^ \\ 
p?, p r (l — p r ), (1 —p r )Pr, and (1 — p r ) 2 will be the weights of the 2 2 subfractals 
of width ^2 in the second step of the construction of K, ... and so on. This 
simplest of cases ascribes probability measure ^ _r (l —p r ) r to all subfractals of 
width I = jit in the fcth step, r between and k. 
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Then a (of such subfractal) = ^± = ^=^±0^ = r^- ) + 
iogfi/3) = Ar + B, where A = Ak, and r an integer, r : — > fc. 

The corresponding /(a), the same as /*(a) in this simplest case, is = 

log 3* ' wncre (r) reacncs hs maximum for r = \ and is entirely symmetric for 
r at left and right of k/2. The concentration a goes linearly with integer r from 
an "origin" B (where /(a) = 0) to _B + A|, where /(a) reaches its maximum, 
to -B + A, where f(a) is again zero. In this simplest binomial case, the increas- 
ing and decreasing branches of the curve are a mirror image of each other, like 
an inverted parabola. (In the case we work with below, such symmetry might 
be absent, still we will consider k as a constant, a going linearly with integer 
r : -» k). 

We will simplify this binomial case further, in order to be able to adapt it 
to our study of the market crash signal. The binomial case comes from the 
unfolding of (a + b) k , with a = p r and 6 = l—p r , and the symmetric properties 
of (J). We have a decreasing index for powers of a's — i.e. for p r 's — to wit: 
PrJ Pr 1 ' ■ ■ - coupled with an exponent increase for powers of b — or 1 — p r — : 
(1 — p r )°, (1 — Pr) 1 , • • ■ ; (o)i (i)v ■ ■ the corresponding number of them. 

Notice that the a & f(a) structure above does not need the ternary of Can- 
tor as a starting point: if we divide the unit invertal / in halfs, in quarters, 
. . . where I = is the size of each interval-box in fcth step, then we simply 
replace "3" by "2" in the denominators of a and f(a) above. In fact, if L is the 
number of equal intervals dividing /, we still have a = Ar + B, with I = ^ in 

the denominator of A, L in that of B, and /(a) = t ^/ just as symmetric as 
above. 

Next, consider a number P > 1 instead of p r above, and the binomial expres- 
sion (a + b) k , a = P and 6=1. We now have a decreasing sequence of weights 
w t : P k ,P k ^ 1 , .. ., P k ~ r . . . instead of p k ,p k ~ x , ...;l—p r has been replaced by 
1. The total T weight is now T = (P + l) k . 

Now, in order to make probabilities out of these weights Wi we have to write 
rp^4p > • • • i (p+i)fc i ■ ■ ■ i (r) the corresponding number of them. With L as before, 
k large and fixed, T = (P + l) k the size of the sample, or total number of such 
elements, we still have, as above, a = Ar + B, r : — > k: a = ^~ r ^°gi~ losr = 

log( P +1 )^, gl o g p + ,, og p = rg p + and f{a) = jog) is .; s ; as sym _ 

metric as above. This will be our working structural framework. 



2.1 The instability quotient 

Qi is a function of a or of r indistinctly, and will be, by definition, Qi(a) = 

Qi(r) := J^Z r = pk-r , an obviously increasing function of r, i.e. of a, between 
OL m in — or t = — and a(f m ax) — or r = | in our binomial case. 

Notice that the instability quotient is made up of "unprocessed" weights 
not logarithms of probabilities — and " unlogarithmized" N a , i.e. with the 
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Figure 2: Instability quotient, binomial case 
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Figure 3: Instability quotient, NQ1000 
raw data which, once processed, yields /*(«) and f(a). 

3 Structural frame for the study of market crash 
situations 

NQ1000 refers to the 1000 daily fluctuations before the April 2000 NASDAQ 
crash; DJIA1600, DJIA900, . . .etc. refer to the 1600, 900, . . .etc. daily fluctu- 
ations before the October 1997 Dow Jones market crash, i.e. 1600 days prior 
to crash, 900 days ditto, etc. (Rotundo, 2006). We start with NQ1000, and 
we represent (ditto for other cases) the graph of the closing market price p as 
a function of time t — one day at a time: one day a unit time. Time t the 
horizontal axis, the corresponding price p the vertical one. We take the signal 
(t,p) and project these discrete values onto the vertical axis p. This set of 1000 
points is a cantordust contained in a vertical segment S. We cover vertical S 
with adjacent interval-boxes of equal size I. The number L of boxes should not 
exceed, ideally, the square root of the total T — 1000 points in the signal. Each 
point in vertical S is a certain market price: the higher the point in S — i.e. 
the higher the box k that contains it — the higher the price p. The weight Wi of 
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box li is the number of days in which market prices p fell between the vertical 
bounds or extremes of ij. Thus, each such box represents an increment Ap in 
prices p. Let p~ and p + be the extremes of segment S: the lower and upper 
prices. For NQ1000 as well as for the DJIA's, we always notice a rather small 
set s of very heavy boxes quite near p~ . These s heavy boxes represent many 
days in which market prices stood firm, with constant values, i.e. prices rather 
stable, not very expensive. The short lower subsegment in S between s and p~ 
represents prices going down — hence its smallness: we will do without it, as 
generally, prices tend to go up. Hence, we will work with heavy boxes as the 
lowest li boxes, and study the nature of price increase. The remaining vertical 
upper segment in S, from heaviest boxes at the bottom to the lightest ones 
on top, will be normalized to be the unit segment [0, 1] = /, the cantordust 
thus reduced will be the fractal cantordust K. Since the signal has a total T of 
1000 points, then L — the number of k boxes — should not exceed vTOOO w 32. 
The sum of all weights in all boxes is 1000. The situation at the top, near p + , 
presents a problem: there are boxes with just 1 point, on top of which we find 
a box with points, then, on top, another box with 1 point. . . These isolated 
— and extremely light boxes — correspond to prices quite high. . . and most un- 
stable, for they came to pass during 1 day (the weight Wi of that li box) only, 
a situation made worse by the upper empty box, followed by an even higher 
box again with 1 point — even higher market price, during one day, with an 
increase of 2Ap: skipping the empty box. . . These abrupt jumps in prices not 
only reek of market crash, they pose a geometrical problem, for isolated and far 
away points do not belong to a fractal ... a problem we will deal with below. 
The boxes at the bottom of vertical /, being the heaviest, correspond to a m i n , 
the highest ones, being the lightest, to a max . 

We have a number of things superimposed, illustrated in diagram D (Fig. 1) 
vertical cantordust K in /, the sequence of price increases Ap (one such increase 
for each box li with weight wi), vertical axis p with stable price for lower box and 
higher price for upper one, and the corresponding vertical interval [a m i n ,a max ], 
with a(f max ) somewhere in the middle. 

We will concentrate our study of market crash situation in two separate 
parts: the analysis of what happens between a m i n and a(f max ), and the study 
of the descending branch — ct(f max ) and a max — of the spectrum. 

The number of adjacent intervals Aa, of equal size, covering vertical interval 
[ctmin, a max] should — ideally — not exceed the square root of the number L of 
boxes li covering /. That leaves us with six such intervals Aa, four of which 
cover the interval [a min ,a(f max )]. 

4 Between a mm and a(f max ) 

The value a(f max ) is calculated in two ways that agree: it is the value of a for 
which q = f'(a) = in the lagrangian coordinates. On the other hand, going 
to the spectrum-by-definition, it is in the fourth interval Aa that all values of 
a in that interval cram together around the value for which q = f'(a) = 0, 
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as observed in a histogram that contains all a's — one for each box li. N a for 
that particular Aa is no smaller than for any other Aa, also: its a values are 
the only ones quite near one another, crowded around the value of a for which 
q = 0. 

In the symmetric case above we had a decreasing sequence of weights u>i — 
P k ,P k ~ 1 , . . . P k ~ r . . . with P > 1, each weight corresponded to a certain Aa, 
each Aa was reduced to a point in that ideal case, with N a = ( ) boxes Zj. 

Here we have the lowest Aa, corresponding to a m i n , endowed with N a = 4 
such boxes with weights 154, 142, 84 and 72 days, respectively, with an average 
of 114 days, or 3.5 months. In the same way in which we identify a's in a certain 
Aa — N a the number of such a's — we will average weights ty's of boxes h in 
that Aa. We recall that each Zj box — pertaining to an interval Aa — represents 
a price increase Ap inside pre-determined bounds \p,p'] given precisely by the 
extremes of interval Aa. 

The second Aa, immediately above the lower Aa min , has N a = 7 boxes Zj 
with weights 64,61,50,42,42,42 and 31 days, respectively, with an average of 
47 days or approx. 1.5 months. The third Aa above has an average of 20 days 
or three weeks, and the fourth one, reaching a(f max ), an average of 10 days. So 
we have a sequence of « 3.5 months, slightly more than 1.5 months or 6 weeks, 
w 3 weeks or 20 days, 10 days: it does look like P k ,P k ~ 1 , ... ,P > 1 with k = 6 
and P slightly larger than 2 — indeed, the average of these "P's" is w 2.2; each 
P = pk-(r+i) is taken as the ratio of two consecutive average number of days 
above. (In fact, the two remaining Aa's between a(f max ) and a max do continue 
with this division by ss 2: the average of 5th interval Aa is slightly less than 5 
days, and the last one corresponds to the 1 or 2 days per box.) 

We can now proceed in three different ways that indicate an early warning 
of the market crash situation. We call it "early" because a(f max ) is situated 
far below a max , and a max is at the very top of the market price list: p + . So 
ot{fmax) is midway between p~ and p + , far below p + . The logic is that before 
prices reached p + values, on their way up, they passed through these critical 
values corresponding to a(f max ). hence, what happens at this value is an early 
warning of future market collapse. 

4.1 The instability index 

In Sec. 2.1 we defined the instability index Qi(r) = p N k t r , r = 0,1,2,... We 
recall that P > 1, that k was fixed, that a = Ar + B, so we can write Qi{a) 
or Qj(r); that for the binomial case Qi(r) is steadily increasing with r (or a), 

and that WG &X6 between Otmin 

(r = 0) and a(f max ), which means r = §. In 
this last ideal case, Qi{r) grows with no change of curvature between r = and 
r = \ (Fig- 2, out of scale). In the case of our NASDAQ signal, the weights 
P k ,P k - 1 , ... are 114, 47, 20 and 10 days (Sec. 4, with a value of P « 2.2) until 
we reach a(f max ); the four N a being 4,7,5, and 7 boxes in the four intervals 
Aa (Sec. 3) between a m j„ and a(f max ). The graph Qi(r), in this case, plotted 
against four integer values of r, shown in Fig. 3 (out of scale), has an inflection 
point, between a m i n and a(f max ), near the latter. 
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Note: the values AT* = 4, AT| = 7, N% = 5, AT* = 7, do not imply that the second 
and fourth Aa have maxima, for in the fourth Aa the a's are concentrated 
around f'(a) = 0, whereas in the second Aa they are dispersed. 

Let us intcrpet Qi. The first Q\ = j-jj = 0.035 is much smaller than the 
second one: ^ = 0.148: the instability increased. In the first instance, prices 
increased every 3.5 months, a rather stable situation, and they did so for 4 
consecutive Ap price increases. Next, the prices began to increase Ap each 
month and a half, a situation that repeated itself seven times, that is why Qi 
increased. The inflection point above is due to an anomaly in the evolution 
of the market: 5 consecutive price increases Ap each 3 weeks are immediately 
followed by price increases each 10 days only! . . . and this abnormal jump takes 
place 7 consecutive times... The logic underneath is the following: we had a 
price increase each month and a half, 7 times. Next, we find a Ap increase each 
3 weeks, which is a much worse situation. But, if this situation is repeated a 
rather-large-than-7 times instead of 5 times, there is room for the system to 
adapt, there is valuable time to adjust to the new situation before time intervals 
between consecutive Ap shorten again. This numerical anomaly in the values 
of N a causes the inflection point in the graph of Qi (a) . 

So, real stability of the system does depend on Qi remaining low, but if it 
grows, the occurrence of a crash depends on the geometry of its graph (inflection 
point), as described above. 

4.2 The second way to study crash early warning: the 
lagrangian spectrum 

We continue with NASDAQ, we take boxes k with weight Wi, divide each Wi by 
T = 1000 and obtain probabilities p r . . Then we can apply the thermodynamical 
algorithm and obtain /(a), as shown in Fig 4. Two observations: a) the curve 
(a, /(a)) is not symmetric — as is the binomial one; b) there is a noticeable gap 
between f'(a) — 1 and f'(a) = — entirely in the interval [a m in, ce(fmax)]- 

Since f'{a) — means a = a(f max ), we observe that the inflection point 
of Qi, very near a{f max ), is located precisely in that gap. If we consider a 
time span where there is no important crash, e.g. the daily fluctuations of Dow 
Jones from 1st Jan. 1990 to 1st Jan. 1994, the corresponding curve (a, f(a)) is 
symmetric, as shown in Fig. 5. It should be noticed that symmetric spectra are 
quite rare. This case is, as far as we are aware, unique in empirical context — the 
binomial case is a purely mathematical one. Moreover, if we take the data from 
the DJIA1600 sample, still the corresponding (a, /(a)) curve mantains both the 
gap and the asymmetry (Fig. 6): enlarging the prior-to-crash sample does not 
"dilute" these crash features. 

Further, if we do the opposite, that is, take smaller and smaller samples prior 
to crash: DJIA900, 800, 700, 600, always maintaining L = \fT (L = number of 
li boxes, T = size of the sample, all as above), we obtain the same f(a) as in 
Fig. 4 — with minute variations (Fig. 7): the process is strictly self similar (i.e. 
independent of size sample, or of scale) around market crash point. Note : In 
order to plot the thermodynamical lagrangian spectra curve (a,f(a)) in each 
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Figure 4: Thermodynamical formalism, NQ1000 
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Figure 5: Symmetric thermodynamical formalism, non-crash situation 
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Figure 6: Thcrmodynamical formalism, DJIA1600 
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Figure 7: Thermodynamical spectra for DJIA900, 800, 700, 600 
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case throughout this section, we have used only integer values for q = f'(ot), 
since they are enough to determine the key properties of /(a): the graph dots for 
q = 0, + 2,— 2>+l) — 1>+!> — §)■•■) would be deceiving, since the corresponding 
graph points at the extremes of the curve would simply pile up on top of each 
other, hence the upper gaps would fill up artificially, without adding any real 
meaning to the geometry of the lagrangian process. 

4.3 A third way to study the early warning gap: the f*(a) 
spectrum 

We continue with NASDAQ data. Recall that T = 1000, and that we have 
between 30 and 40 a's, from boxes k grouped in 6 intervals Aa, the first four 
span [ce m i n ,a(f max )], the a-interval of our study. We also recall that f*(a) is 
the spectrum-by-dcfinition and /(a) the lagrangian one. The first Aa interval, 
the one corresponding to a min , has 4 a's, more or less distributed evenly in 
the interval. The second adjacent Aa (seven a's) has its a's not so dispersed, 
some are near the a for which q = /'(a) = 1. The third Aa has very few 
and dispersed a's. So /*(a) and /(a) share the same a min , then /*(a) grows 
to a local maximum at an a for which /'(a) = 1,. . .then it goes down, and it 
does it in the gap between /'(a) = 1 and /'(a) = 0. The fourth interval Aa 
includes the particular a for which f'(a) = 0. All its a's are clustered around 
this particular a, which implies that a spectrum (another one) /* should have 
an absolute maximum at that value of a, just as /(a) does — in the 5th and 6th 
remaining Aa intervals the number of a's steadily decreases, and so does this 
last f*(a). 

The picture corresponds to two /*(a): fi(a) starts in a m i n , reaches a max- 
imum a.\ for q = /'(a) = 1, and descends in the gap between q = 1 and q = 0. 
The other, /|(a), goes from its own maximum at ao (for which q — .f'(a) = 0) 
and decreases as a nears a max (shared by /1(a) and /(a)). By considering a 
larger sample prior to market crash, DJIA1600, we have one more Aa, which 
clarifies the situation: /|(a) starts with a low value in the gap between a\ 
and ao (i.e. q — 1 and q = 0), then grows to an absolute maximum at ao, 
where /'(a) = q = 0, and then decreases as a nears a max : clearly, two different 
/* 2 (a) (with negative second derivative), which seem to collide in "the gap"; 
Fig. 8 shows a schematic diagram representing this situation. The two /^(a) 
multifractal spectra — bi-multifractality — are two curves for which /(a) acts as 
an envelope, as a smooth upper "roof of both. According to Radon and Stoop 
(1996), this situation — bi-multifractality with an upper envelope — occurs when 
two different fractal measures are processed with one and the same multifractal 
algorithm. 



5 The situation between f'(a) = and a max 

As a approaches a max , the market prices increase more and more, to the point 
in which they can be sustained during one day only. This situation, as we 
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Figure 8: Diagrammatic spectra /* and /| 



noticed before, is reflected in boxes with 1 or points (days) near the p + top. 
We have several such boxes at the top prices, separated from each other by 
zero point empty boxes. This situation, which we call "dispersion", depicts a 
number of isolated points far away from the main body of the fractal cantordust, 
and, from the point of view of fractal geometry, they should be discarded — no 
isolated point contributes dimensionally: that is why we used several times the 
expression "between 30 and 40 boxes", for quite some such boxes have to be 
discarded when doing a spectrum-by-definition f*(a). Instead, the lagrangian 
f{a) process does not seem to care where the boxes are: isolated and far away 
or in the very midst of the cantordust, but how many they are, for each value 
of a, regardless of their location. Of these two spectra, we choose f(a) in order 
to interpret and quantify the dispersion, since it takes into consideration all the 
dispersed boxes near a max . Near crash, prices jump so high, and so high, that 
we have many dispersed boxes: N amaa . is large, and so is f(a max ), as we observe 
for NASDAQ crash in Fig. 4, where the large dispersion in a max substantially 
contributes to the non-symmetry of the curve. 

Carefully notice that, when we enlarge the prior-to-crash sample, e.g. DJIA 
1600, Fig. 6, we slightly "dilute" the crash effect on the descending right branch 
of the corresponding /(a), for f{a max ) is much smaller — and the dots closing 
the right branch are much more separated than in the NASDAQ case, though 
they are drawn with the same parametric q values as for NASDAQ. Also, Fig. 
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7 shows that as we take the sample nearer and nearer crashpoint, DJIA900, 
800, 700, 600,. . .the higher the value of f(a max ): the highest one corresponds 
to DJIA600 (the numerically smallest sample which still yields a cantordust). 
The economic interpretation: those extraordinarily high jumps in market prices 
— the dispersion — increase as the system destabilizes, as we get dangerously 
near its collapse. 

Note : An observation on the inflection point in Qi (Fig. 3): Qi grows 
between a m m and a(f max ), since the numerator N a grows, and the denominator 
is a decreasing power. When /j" grows (up to q = 1) then Qi grows, ditto when 

grows . . . but, in the middle of these two growing spectra, exactly between 
q = 1 and q = 0, Qi does not grow . . . but sits down, hence the inflection. 

6 Conclusions 

A prior-to-crash market signal is studied as a cantordust, its fractal properties, 
responsible for the crash situation, are studied with the tools of multifractal 
analysis. Two different multifractal spectra (a, /(a)), and an instability quo- 
tient Qi characterize and quantify an "early warning" of market collapse, with 
market prices (not yet very high) corresponding to values of a between a m i n 
and a{f max ); the study of the highest market prices, corresponding to values 
of a near a max , is done by analysing the fractal dispersion of the cantordust 
prior-to-crash market signal. 
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